Search CORE

32 research outputs found

Value Iteration for Long-run Average Reward in Markov Decision Processes

Author: A Komuravelli
A McIver
AF Veinott
AK McIver
C Baier
C Courcoubetis
J Filar
K Chatterjee
K Chatterjee
K Chatterjee
K Chatterjee
M Duflot
M Kwiatkowska
M Kwiatkowska
M Kwiatkowska
ML Puterman
O Michael
RA Howard
S Giro
S Haddad
T Brázdil
T Brázdil
T Brázdil
Publication venue
Publication date: 31/08/2017
Field of study

Markov decision processes (MDPs) are standard models for probabilistic systems with non-deterministic behaviours. Long-run average rewards provide a mathematically elegant formalism for expressing long term performance. Value iteration (VI) is one of the simplest and most efficient algorithmic approaches to MDPs with other properties, such as reachability objectives. Unfortunately, a naive extension of VI does not work for MDPs with long-run average rewards, as there is no known stopping criterion. In this work our contributions are threefold. (1) We refute a conjecture related to stopping criteria for MDPs with long-run average rewards. (2) We present two practical algorithms for MDPs with long-run average rewards based on VI. First, we show that a combination of applying VI locally for each maximal end-component (MEC) and VI for reachability objectives can provide approximation guarantees. Second, extending the above approach with a simulation-guided on-demand variant of VI, we present an anytime algorithm that is able to deal with very large models. (3) Finally, we present experimental results showing that our methods significantly outperform the standard approaches on several benchmarks

arXiv.org e-Print Archive

Crossref

On a Multistage Stochastic Linear Program

Author: AF Veinott Jr.
AF Veinott Jr.
FR Gantmakher
K Sladky
RA Howard
RC Grinold
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1994
Field of study

Crossref

Maximum-Stopping-Value Policies in Finite Markov Population Decision Chains

Author: Arthur F. Veinott
B. Curtis Eaves
Bellman R
Bertsekas DP
d'Epenoux F
Derman C
Dynkin EB
Howard RA
Veinott AF
Veinott AF
Publication venue: 'Institute for Operations Research and the Management Sciences (INFORMS)'
Publication date
Field of study

Crossref

Planungsmodelle zur Nutzung der Flexibilität Kombinierter Eil- und Normalmassnahmen

Author: AF Veinott
G Kässmann
HM Wagner
K Spicher
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1989
Field of study

Crossref

Literaturverzeichnis

Author: A Veinott
AF Veinott Jr
AF Veinott Jr.
BA Kalymon
BG Kingsman
D Hochstädter
D Iglehart
E Zabel
F Kolberg
H Bauer
H Klemm
KJ Arrow
M Athans
S Karlin
T Fabian
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1978
Field of study

Crossref

An Overview of Inventory Systems with Several Demand Classes

Author: A Kaplan
AF Veinott Jr.
AF Veinott Jr.
AY Ha
AY Ha
C Henaux
D Atkins
DM Topkis
EA Silver
EL Porteus
HL Lee
L Moon
LW Robinson
MA Cohen
MJ Kleijn
P Melchiors
PJ Held
R Dekker
RH Teunter
RV Evans
S Nahmias
V Nguyen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1999
Field of study

Crossref

EUR Research Repository

Computing Optimal Cycles of Homology Groups

Author: A Zomorodian
A Zomorodian
AF Veinott Jr
C Chen
G Fermi
HM Berman
J Munkres
T Kaczynski
TK Dey
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

The functional equations of undiscounted denumerable state Markov renewal programming

Author: A Federgruen
A Federgruen
A Hordijk
A Hordijk
AF Veinott
D Blackwell
E Mann
E Çinlar
H Zijm
KL Chung
PJ Schweitzer
PJ Schweitzer
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1986
Field of study

Crossref

The newsvendor model revisited: the impacts of high unit holding costs on the accuracy of the classic model

Author: AF Veinott
AF Veinott
DP Heyman
EL Porteus
EL Porteus
EL Porteus
G Hadley
H Krishnan
Hong Yan
I Correia
Jacqueline Wenjie Wang
JAV Mieghem
KJ Arrow
KJ Arrow
KJ Arrow
M Ferguson
M Olivares
M Shi
NC Petruzzi
P Berling
RV Evans
S Li
Shaolong Tang
SJ Erlebacher
Stella Cho
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Network Planning Using Two-Stage Programming under Uncertainty

Author: A Prékopa
A Prékopa
A Prékopa
A Prékopa
A Prékopa
AF Veinott
AJ Hoffman
C Borell
CR Scherer
D Gale
GB Dantzig
P Kall
R Billinton
R Wets
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1980
Field of study

Crossref